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Abstract 

Kramers' law describes the mean transition time of an overdamped Brownian par- 
ticle between local minima in a potential landscape. We review different approaches 
that have been followed to obtain a mathematically rigorous proof of this formula. We 
also discuss some generalisations, and a case in which Kramers' law is not valid. This 
review is written for both mathematicians and theoretical physicists, and endeavours 
to link concepts and terminology from both fields. 
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1 Introduction 

The overdamped motion of a Brownian particle in a potential V is governed by a first-order 
Langevin (or Smoluchowski) equation, usually written in the physics literature as 



x = -VV(x) + V2£6, (1.1) 

where & denotes zero-mean, delta-correlated Gaussian white noise. We will rather adopt 
the mathematician's notation, and write (jl.ip as the Ito stochastic differential equation 

dx t = -VV(xt)dt + V2edWt , (1.2) 

where Wt denotes d-dimensional Brownian motion. The potential is a function V : R d — > 
R , which we will always assume to be smooth and growing sufficiently fast at infinity. 

The fact that the drift term in (II. 2D has gradient form entails two important properties, 
which greatly simplify the analysis: 

1. There is an invariant probability measure, with the explicit expression 

H(dx) = \ e ~ v ^/ £ dx , (1.3) 

where Z is the normalisation constant. 

2. The system is reversible with respect to the invariant measure jx, that is, the transition 
probability density satisfies the detailed balance condition 

p(y, t\x, 0) e~ v ^/ E = p(x, t\y, 0) e~ v W e . (1.4) 
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Figure 1. Graph of a potential V in dimension d = 
and saddle z*. 



y* 

2, with two local minima x* and y* 



The main question we are interested in is the following. Assume that the potential V 
has several (meaning at least two) local minima. How long does the Brownian particle 
take to go from one local minimum to another one? 

To be more precise, let x* and y* be two local minima of V, and let B$(y*) be the ball 
of radius 5 centred in y*, where 5 is a small positive constant (which may possibly depend 
on e). We are interested in characterising the first-hitting time of this ball, defined as the 
random variable 



The two points x* and y* being local minima, the potential along any continuous path 7 
from x* to y* must increase and decrease again, at least once but possibly several times. 
We can determine the maximal value of V along such a path, and then minimise this value 
over all continuous paths from x* to y* . This defines a communication height 



Although there are many paths realising the infimum in (]1.6p . the communication height 
is generically reached at a unique point z* , which we will call the relevant saddle between 
x* and y*. In that case, H(x*,y*) = V(z*) (see Figured]). One can show that generically, 
z* is a critical point of index 1 of the potential, that is, when seen from z* the potential 
decreases in one direction and increases in the other d — 1 directions. This translates 
mathematically into VV(z*) = and the Hessian V 2 V~(z*) having exactly one strictly 
negative and d — 1 strictly positive eigenvalues. 

In order to simplify the presentation, we will state the main results in the case of a 
double- well potential, meaning that V has exactly two local minima x* and y*, separated 
by a unique saddle z* (Figure [[]) , henceforth referred to as "the double- well situation" . 
The Kramers law has been extended to potentials with more than two local minima, and 
we will comment on its form in these cases in Section [3.31 below. 

In the context of chemical reaction rates, a relation for the mean transition time r^w 
was first proposed by van t'Hoff, and later physically justified by Arrhenius j Arr8 9] . It 
reads 



Ty* = in£{t > 0: x t G Bs(y*)} where xq = x* . 
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The Eyring-Kramers law |Eyr35[ !Kra 40j is a refinement of Arrhenius' law, as it gives an 
approximate value of the prefactor C in (jl.7p . Namely, in the one-dimensional case d = 1, 
it reads 

E{r -; } ~ 27r e \y{**)-v(x*)]/e (L8) 

1 " ' y/V"(x*)\V"(z*)\ 

that is, the prefactor depends on the curvatures of the potential at the starting minimum 
x* and at the saddle z*. Smaller curvatures lead to longer transition times. 
In the multidimensional case d ^ 2, the Eyring-Kramers law reads 



EW X *\ ~ 2tt / |det(V2F( z *))| [viz *y v{x * )]/e 

|A a (2*)|y det(V 2 F(**)) ' 1 J 

where Ai(z*) is the single negative eigenvalue of the Hessian V 2 V(z*). If we denote the 
eigenvalues of V 2 V(z*) by AiO*) < < \ 2 (z*) < • • • < A d (z*), and those of V 2 F(x*) by 
< Xi(x*) ^ • • • ^ \d(x*), the relation (jl.lOp can be rewritten as 



which indeed reduces to (jl.8p in the case d = 1. Notice that for d ^ 2, smaller curvatures 
at the saddle in the stable directions (a "broader mountain pass") decrease the mean 
transition time, while a smaller curvature in the unstable direction increases it. 

The question we will address is whether, under which assumptions and for which 
meaning of the symbol ~ the Eyring-Kramers law (jl.9p is true. Answering this question 
has taken a surprisingly long time, a full proof of (]1.9p having been obtained only in 
2004 [BEGK04] . 

In the sequel, we will present several approaches towards a rigorous proof of the Arrhe- 
nius and Eyring-Kramers laws. In Section [21 we present the approach based on the theory 
of large deviations, which allows to prove Arrhenius' law for more general than gradient 
systems, but fails to control the prefactor. In Section [3l we review different analytical 
approaches, two of which yield a full proof of (jl.9p . Finally, in Section HI we discuss some 
situations in which the classical Eyring-Kramers law does not apply, but either admits a 
generalisation, or has to be replaced by a different expression. 
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2 Large deviations and Arrhenius' law 



The theory of large deviations has applications in many fields of probability [DZ98, DS89]. 
It allows in particular to give a mathematically rigorous framework to what is known in 
physics as the path-integral approach, for a general class of stochastic differential equations 
of the form 

dx t = f(x t )dt + V^dW t , (2.1) 

where / need not be equal to the gradient of a potential V (it is even possible to consider 
an x-dependent diffusion coefficient y/2e g(xt) dWi). In this context, a large-deviation 
principle is a relation stating that for small e, the probability of sample paths being close 
to a function <p(t) behaves like 

P{x t ~ (f(t), < t < T} ~ e - /(t/,)/2e (2.2) 

(see (|2.4p below for a mathematically precise formulation). The quantity I(<p) = /[o j t]( < /') is 
called rate function or action functional. Its expression was determined by Schilder [Sch66j 
in the case / = of Brownian motion, using the Cameron-Martin-Girsanov formula. 
Schilder's result has been extended to general equations of the form (|2.ip by Wentzell and 
Freidlin [VF70], who showed that 

I(v) = l£\m)-f(<P(t))\\ 2 dt. (2.3) 

Observe that 1(f) is nonnegative, and vanishes if and only if ip(t) is a solution of the 
deterministic equation (p = /(</?). One may think of the rate function as representing the 
cost of tracking the function ip rather than following the deterministic dynamics. 

A precise formulation of (|2.2[) is that for any set Y of paths ip : [0, T] — > R rf , one has 

- mil < liminf 2elogP{(x t ) G Y} < limsup2elogP{(xi) G Y} < -infl . (2.4) 

~ ^ s — >o r 

For sufficiently well-behaved sets of paths Y, the infimum of the rate function over the 
interior Y° and the closure Y coincide, and thus 

lim 2s log P{ (xt) G T) = -infl . (2.5) 

£->o r 

Thus roughly speaking, we can write P{(x^) G Y} ~ e -infr//2e^ -^ n ^ we g^Quioi keep in 
mind that this is only true in the sense of logarithmic equivalence ()2.5[) . 

Remark 2.1. The large-deviation principle (|2.4p can be considered as an infinite-dimen- 
sional version of Laplace's method. In the finite-dimensional case of functions w : R d — > R , 
Laplace's method yields 

lim 2e log f e ~ w{x)/2e dx = -mfw , (2.6) 
e^o J T r 

and also provides an asymptotic expansion for the prefactor C(e) such that 

J e -w(x)/2e dx = c ^ e - inf r w/2e _ ^ _^ 

This approach can be extended formally to the infinite-dimensional case, and is often used 
to derive subexponential corrections to large-deviation results (see e.g. [MS93j). We are 
not aware, however, of this procedure being justified mathematically. 
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Figure 2. The setting of Theorems 12.21 and 12.31 The domain V contains a unique stable 
equilibrium point x*, and all orbits of the deterministic system x = f{x) starting in T> 
converge to x* . 



Let us now explain how the large-deviation principle (|2,4p can be used to prove Ar- 
rhenius' law. Let x* be a stable equilibrium point of the deterministic system x = f{x). 
In the gradient case / = — VV, this means that x* is a local minimum of V. Consider a 
domain PcR^ whose closure is included in the domain of attraction of x* (all orbits of 
x = f{x) starting in V converge to x*, see Figure [2|). The quasipotential is the function 
defined for z £ V by 

Viz) = inf inf I(<p) . (2.8) 

T>0 <p: V (0)=x* , V (T)=z 

It measures the cost of reaching z in arbitrary time. 

Theorem 2.2 ( |VF69[ IVF70] ). Let r = inf{t > 0: x t V] denote the first-exit time of 
xt from V. Then for any initial condition xq S V, we have 

lim 2e log E xo {r} = inf V(z) =: V . (2.9) 

Sketch of proof. First one shows that for any xo € T>, it is likely to hit a small neigh- 
bourhood of x* in finite time. The large-deviation principle shows the existence of a time 
T > 0, independent of e, such that the probability of leaving V in time T is close to 
p = e~ v / 2e . Using the Markov property to restart the process at multiples of T, one 
shows that the number of time intervals of length T needed to leave T> follows an approx- 
imately geometric distribution, with expectation 1/p = e y l 2e (these time intervals can be 
viewed as repeated "attempts" of the process to leave T>). The errors made in the different 
approximations vanish when taking the limit (12. 9p . □ 



Wentzell and Freidlin also show that if the quasipotential reaches its minimum on dT> 
at a unique, isolated point, then the first-exit location x T concentrates in that point as 
e — > 0. As for the distribution of r, Day has shown that it is asymptotically exponential: 

Theorem 2.3 (|Day83|). In the situation described above, 

limP{r > sE{r}} = e~ s (2.10) 

for all s > 0. 
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In general, the quasipotential V has to be determined by minimising the rate func- 
tion (|2.3p . using either the Euler-Lagrange equations or the associated Hamilton equations. 
In the gradient case / = — VV, however, a remarkable simplification occurs. Indeed, we 
can write 



1 



T 



Jfo0 = 2 / ||0(t)+W(v(*))rdf 



o 



T r T 
2 



\\v(t)-vv&(t))\\ 2 at + 2 / (<p(t),vv(tp(t)))dt 

o io 

||^(t)-vy(^(t))|| 2 di + 2[y(^(r))-y(^(o))] . (2.ii) 



o 



The first term on the right-hand vanishes if <p(t) is a solution of the time-reversed deter- 
ministic system <p = +W(y?). Connecting a local minimum x* to a point in the basin of 
attraction of x* by such a solution is possible, if one allows for arbitrarily long time. Thus 
it follows that the quasipotential is given by 



V = 2 



miV-V{x*) . (2.12) 
L&D 



Corollary 2.4. In the double-well situation, 

lime log E{r w) } = VV) - . (2.13) 

Sketch of proof. Let T> be a set containing x*, and contained in the basin of attraction 
of x*. One can choose T> in such a way that its boundary is close to z*, and that the 
minimum of V on &D is attained close to z*. Theorem 12.21 and (|2.12j) show that a relation 
similar to fj2 . 13f) holds for the first-exit time from T>. Then one shows that once xt has 
left T>, the average time needed to hit a small neighbourhood of y* is negligible compared 
to the expected first-exit time from T>. □ 

Remark 2.5. 

1. The case of more than two stable equilibrium points (or more general attractors) can 
be treated by organising these points in a hierarchy of "cycles" , which determines the 
exponent in Arrhenius' law and other quantities of interest. See [FW98| [FreOO| . 

2. As we have seen, the large-deviations approach is not limited to the gradient case, but 
also allows to compute the exponent for irreversible systems, by solving a variational 
problem. However, to our knowledge a rigorous computation of the prefactor by this 
approach has not been achieved, as it would require proving that the large-deviation 
functional I also yields the correct subexponential asymptotics. 
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Figure 3. Symmetric random walk on Z with two absorbing sets A, B. 

3 Analytic approaches and Kramers' law 

The different analytic approaches to a proof of Kramers' law are based on the fact that 
expected first-hitting times, when considered as a function of the starting point, satisfy 
certain partial differential equations related to Feynman-Kac formulas. 

To illustrate this fact, we consider the case of the symmetric simple random walk on 
Z . Fix two disjoint sets A, B C Z , for instance of the form A = (—00, a] and B = [b, 00) 
with a < b (Figure [3]). A first quantity of interest is the probability of hitting A before B, 
when starting in a point x between A and B: 

h A>B (x) = F x {t a < r B } . (3.1) 

For reasons that will become clear in Section 13.31 h AjB is called the equilibrium potential 
between A and B. Using the Markov property to restart the process after the first step, 
we can write 

h A>B (x) = P x {t a <Tb ,X 1 = x + 1} + P x {t a <t b ,X x =x- 1} 
= P x {t a <t b \X 1 =x + l}P x {Xt =x + l} 

+ ¥ x {t a <t b \X 1 = x- = x - 1} 

= h A , B {x + l)-\ + h A , B {x-l)-\. (3.2) 

Taking into account the boundary conditions, we see that h AyB {x) satisfies the linear 
Dirichlet boundary value problem 

Ah A>B (x) = , X£(AUB) C , 
h A , B (x) = 1 , x £ A , 

h AB (x) = 0, xeB, (3.3) 

where A denotes the discrete Laplacian 

(Ah)(x) = h(x - 1) - 2h(x) + h(x + 1) . (3.4) 

A function h satisfying Ah = is called a (discrete) harmonic function. In this one- 
dimensional situation, it is easy to solve ()3. 3|) : h AB is simply a linear function of x 
between A and B. 

A similar boundary value problem is satisfied by the mean first-hitting time of A, 
w A {x) = M x {t a }, assuming that A is such that the expectation exist (that is, the random 
walk on A c must be positive recurrent). Here is an elementary computation (a shorter 
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derivation can be given using conditional expectations): 



w A (x) = Y j k¥ x {r A = k} 



k 



E^ 1 



^- 1 {r A = k-l} + \¥ x+l {r A = k-l} 



= E( £ + !) [l pa:-1 {^ =i} + ^ x+1 {r A = £} 

t 

= \w A {x-l) + \w A {x + l) + l . (3.5) 

In the last line we have used the fact that t a is almost surely finite, as a consequence of 
positive recurrence. It follows that w A (x) satisfies the Poisson problem 

\Aw A {x) = -1 , x £ A c , 

w A (x) = , x E A . (3.6) 

Similar relations can be written for more general quantities of the form K x {e ArA 1{t j4 <t s }}- 
In the case of Brownian motion on M d , the probability h A ^{x) of hitting a set A before 
another set B satisfies the Dirichlet problem 

\Ah AiB (x) = , xe(AuB) c , 
h A ,B{x) = 1 , x £ A , 

h A , B (x) = , x G B , (3.7) 

where A now denotes the usual Laplacian in M d , and the expected first-hitting time of A 
satisfies the Poisson problem 

±Aw A (x) = -1 , x G A c , 

= , xeA. (3.8) 

For more general diffusions of the form 

dx t = -VV(x t ) dt + V2e dW t , (3.9) 

Dynkin's formula |Dyn65 |0ks85| shows that similar relations as (|3.7p . (|3.8p hold, with 
^A replaced by the infinitesimal generator of the diffusion, 

L = eA- VV(x) • V . (3.10) 

Note that L is the adjoint of the operator appearing in the Fokker-Planck equation, 
which is more familiar to physicists. Thus by solving a boundary value problem involving 
a second-order differential operator, one can in principle compute the expected first-hitting 
time, and thus validate Kramers' law. This turns out to be possible in the one-dimensional 
case, but no general solution exists in higher dimension, where one has to resort to per- 
turbative techniques instead. 

Remark 3.1. Depending on the set A, Systems (|3.6p and (|3.7p need not admit a bounded 
solution, owing to the fact that the symmetric random walk and Brownian motion are 
null recurrent in dimensions d = 1,2 and transient in dimensions d ^ 3. A solution 
exists, however, for sets A with bounded complement. The situation is less restrictive for 
diffusions in a confining potential V, which are usually positive recurrent. 
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Figure 4. Example of a one-dimensional potential for which Kramers' law (|3.15[) holds. 



3.1 The one-dimensional case 

In the case d = 1, the generator of the diffusion has the form 

(Lu)(x) = su"(x) - V'{x)u(x) , (3.11) 

and the equations for }ia,b{x) = F x {ta < tb} and wa{x) = E x {ta} can be solved 
explicitly. 

Consider the case where A = (— oo,a) and B = (6, oo) for some a < b, and x G (a, b). 
Then it is easy to see that the equilibrium potential is given by 

[ b e -v{y)/e dy 

h A , B (x) = ~T h • (3-12) 

r V{y)/e Ay 



(3.13) 



Laplace's method to lowest order shows that for small e, 



hA,B(x) ^ exp 



sup V — sup V 

.[a,b] [x,b] . 



As one expects, the probability of hitting A before B is close to 1 when the starting point 
x lies in the basin of attraction of a, and exponentially small otherwise. 
The expected first-hitting time of A is given by the double integral 

w A (x) = - / e [v ^- viy)]/£ dydz . (3.14) 

£ J a J z 

If we assume that x > y* > z* > a, where V has a local maximum in z* and a local 
minimum in y* (Figured]), then the integrand is maximal for (y, z) = (y*, z*) and Laplace's 
method yields exactly Kramers' law in the form 

{ta} = WA (x) = e iv^-v(v^ [1 + oWi)] . (3.15) 
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3.2 WKB theory 

The perturbative analysis of the infinitesimal generator (|3.10p of the diffusion in the limit 
e — > is strongly connected to semiclassical analysis. Note that L is not self-adjoint for 
the canonical scalar product, but as a consequence of reversibility, it is in fact self- adjoint 
in L 2 (M d , e~ v / £ dx). This becomes immediately apparent when writing L in the equivalent 
form 

L = ee v/£ V -e- v/£ V (3.16) 
(just write out the weighted scalar product). It follows that the conjugated operator 

L = e- y / 2£ Le v / 2£ (3.17) 

is self-adjoint in L 2 (R. d , dx). In fact, a simple computation shows that L is a Schrodinger 
operator of the form 

L = eA + -U (x) , (3.18) 

where the potential U is given by 

U(x) = \eAV(x)-\\\VV(x)\\ 2 . (3.19) 

Example 3.2. For a double-well potential of the form 

V(x) = \x" - \x 2 , (3.20) 

the potential U in the Schrodinger operator takes the form 

U{x) = -jx 2 (x 2 - l) 2 + h{x 2 - l) 2 . (3.21) 

Note that this potential has 3 local minima at almost the same height, namely two of 
them at ±1 where C/(±l) = and one at where f7(0) = e/2. 

One may try to solve the Poisson problem Lwa = — 1 by WKB-techniques in order 
to validate Kramers' formula. A closely related problem is to determine the spectrum 
of L. Indeed, it is known that if the potential V has n local minima, then L admits n 
exponentially small eigenvalues, which are related to the inverse of expected transition 
times between certain potential minima. The associated eigenfunctions are concentrated 
in potential wells and represent metastable states. 

The WKB-approach has been investigated, e.g., in [SM791 IBM881 IKM961 IMS97j . 
See [KolOO] for a recent review. A mathematical justification of this formal procedure 
is often possible, using hard analytical methods such as microlocal analysis }HS84j IHS85bl 
HS85a, HS85c], which have been developed for quantum tunnelling problems. The diffi- 
culty in the case of Kramers' law is that due to the form (I3.19P of the Schrodinger potential 
U, a phenomenon called "tunnelling through nonresonant wells" prevents the existence of 
a single WKB ansatz, valid in all R rf . One thus has to use different ansatzes in different 
regions of space, whose asymptotic expansions have to be matched at the boundaries, a 
procedure that is difficult to justify mathematically. 

Rigorous results on the eigenvalues of L have nevertheless been obtained with different 
methods in [HKS891 IMic95l IMat95j . but without a sufficiently precise control of their 
subexponential behaviour as would be required to rigorously prove Kramers' law. 
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Figure 5. Green's function Ga c {%, v) for Brownian motion is equal to the electrostatic 
potential in x created by a unit charge in y and a grounded conductor in A. 



3.3 Potential theory 

Techniques from potential theory have been widely used in probability theory [Kak45, 
Doo84, DS84, Szn98j. Although Wentzell may have had in mind its application to Kramers' 
law }Ven73| . this program has been systematically carried out only quite recently by Bovier, 
Eckhoff, Gayrard and Klein |BEGK04l IBGK05] . 

We will explain the basic idea of this approach in the simple setting of Brownian motion 
in W d , which is equivalent to an electrostatics problem. Recall that the first-hitting time 
ta of a set A C E d satisfies the Poisson problem (I3.6h . It can thus be expressed as 



w A {x) = - / G A c(x,y)dy, (3.22) 

where GA c (x,y) denotes Green's function, which is the formal solution of 

I Ait (a) = 5(x -y) , x e A c , 

u(x) = , xe A. (3.23) 

Note that in electrostatics, Ga c (x,v) represents the value at x of the electric potential 
created by a unit point charge at y, when the set A is occupied by a grounded conductor 
(Figure EJ)- 

Similarly, the solution Iia,b(x) = F x {ta < tb} of the Dirichlet problem (j3.7|) represents 
the electric potential at x, created by a capacitor formed by two conductors at A and B, 
at respective electric potential 1 and (Figure EJ- Hence the name equilibrium potential. 
If pa,b denotes the surface charge density on the two conductors, the potential can thus 
be expressed in the form 

h A ,B(x)= [ G B <{x,y)p A ,B{&y) • ( 3 - 24 ) 

JdA 

Note finally that the capacitor's capacity is simply equal to the total charge on either of 
the two conductors, given by 



cap A (B) 



PA,B{dy) 
dA 



(3.25) 
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FIGURE 6. The function Iia.b{x) — P x {ta < tb} is equal to the electric potential in x of 
a capacitor with conductors in A and B, at respective potential 1 and 0. 



The key observation is that even though we know neither Green's function, nor the 
surface charge density, the expressions (13.22p . (13.24j) and (13.25P can be combined to yield a 
useful relation between expected first-hitting time and capacity. Indeed, let C be a small 
ball centred in x. Then we have 

hc,A(y) d V = / G A c(y,z)pc,A(dz)dy 
a c Ja c Jac 



wa{z)pca{& z ) ■ (3.26) 

ac 

We have used the symmetry Ga c (v, z) = Ga c (z, y), which is a consequence of reversibility. 
Now since C is a small ball, if wa does not vary too much in C, the last term in (|3.26p 
will be close to wa(x) cap c (A). This can be justified by using a Harnack inequality, which 
provides bounds on the oscillatory part of harmonic functions. As a result, we obtain the 
estimate 

hc,A(y) d v 



■ (3.27) 

This relation is useful because capacities can be estimated by a variational principle. 
Indeed, using again the electrostatics analogy, for unit potential difference, the capacity 
is equal to the capacitor's electrostatic energy, which is equal to the total energy of the 
electric field V/i: 

cap A (B)= [ \\Vh A , B (x)\\ 2 dx. (3.28) 

J(AUB) C 

In potential theory, this integral is known as a Dirichlet form. A remarkable fact is that 
the capacitor at equilibrium minimises the electrostatic energy, namely, 

Ca p A (B)= inf / \\Vh(x)\\ 2 dx , (3.29) 

where Ha,b denotes the set of all sufficiently regular functions h satisfying the boundary 
conditions in (|3.7p . Similar considerations can be made in the case of general reversible 
diffusions of the form 

dx t = -VV(x t ) dt + V2e dW t , (3.30) 
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a crucial point being that reversibility implies the symmetry 

e -v(*)/e G A c(x,y) =e- v ^GMy,x) . (3.31) 
This allows to obtain the estimate 

E *^l = = ~ cap c (.4) • (3 ' 32) 

where the capacity is now defined as 

cap A (B)= inf / ||V/i(x)|| 2 e- y(:c)/£ dx . (3.33) 
hen A ,B J(aub) c 

The numerator in (|3.32p can be controlled quite easily. In fact, rather rough a priori 
bounds suffice to show that if x* is a potential minimum, then he, a is exponentially close 
to 1 in the basin of attraction of x*. Thus by straightforward Laplace asymptotics, we 
obtain 

p (n--\d/2 „-V(x*)/e 

/ h c ,A(y)e- v ^dy= 1 ^=— = -[l + 0{y/i\]oge\)] . (3.34) 
JA C y/det[\/ d V(x*)) 

Note that this already provides one "half" of Kramers' law (|1.9|) . The other half thus 
has to come from the capacity cap c (A), which can be estimated with the help of the 
variational principle (|3. 33|) . 

Theorem 3.3 ([BEGK04]). In the double-well situation, Kramers' law holds in the sense 
that 

where B e (y*) is the ball of radius e (the same e as in the diffusion coefficient) centred 
in y*. 

Sketch of proof. In view of (13. 32ft and (13.34 j) . it is sufficient to obtain sharp upper 
and lower bounds on the capacity, of the form 



The variational principle f|3.33j) shows that the Dirichlet form of any function h G Ha,b 
provides an upper bound on the capacity. It is thus sufficient to construct an appropriate 
h. It turns out that taking h(x) = h\(xx), depending only on the projection x\ of x on the 
unstable manifold of the saddle, with h\ given by the solution (I3.12P of the one-dimensional 
case, does the job. 

The lower bound is a bit more tricky to obtain. Observe first that restricting the 
domain of integration in the Dirichlet form (13.331) to a small rectangular box centred in 
the saddle decreases the value of the integral. Furthermore, the integrand ||V/i(x)|| 2 is 
bounded below by the derivative in the unstable direction squared. For given values of 
the equilibrium potential h^,B on the sides of the box intersecting the unstable manifold 
of the saddle, the Dirichlet form can thus be bounded below by solving a one-dimensional 
variational problem. Then rough a priori bounds on the boundary values of h^,B yield 
the result. □ 
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FIGURE 7. Example of a three- well potential, with associated metastable hierarchy. The 
relevant communication heights are given by H^xi,, {xi, x%}) = hi and H(x\,x\) = h\. 

Remark 3.4. For simplicity, we have only presented the result on the expected transition 
time for the double-well situation. Results in [BEGK04, BGKQ5] also include the following 
points: 

1. The distribution of tq e ^ is asymptotically exponential, in the sense of (|2. 10|) . 

2. In the case of more than 2 local minima, Kramers' law holds for transitions between 
local minima provided they are appropriately ordered. See Example 13.51 below. 

3. The small eigenvalues of the generator L can be sharply estimated, the leading terms 
being equal to inverses of mean transition times. 

4. The associated eigenfunctions of L are well-approximated by equilibrium potentials 
Ha,b for certain sets A, B. 

If the potential V has n local minima, there exists an ordering 

x* -< x\ < < x* (3.37) 

such that Kramers' law holds for the transition time from each to the set = 

{x\, . . . , Xfc}. The ordering is defined in terms of communication heights by the condition 

H(x%,Mk-i) «S min H(x*, M k \x*)-6 (3.38) 

i<k 

for some > 0. This means that the minima are ordered from deepest to shallowest. 

Example 3.5. Consider the three-well potential shown in Figure [71 The metastable 
ordering is given by 

xg -< x\ -< x* 2 , (3.39) 

and Kramers' law holds in the form 

E x *{t 3 } ~ C ie hl/e , E x 2{t {1i3} } ~ C 2 e h2/E , (3.40) 

where the constants C\,C<z depend on second derivatives of V. However, it is not true 
that E^a {r 3 } ~ C2 e h2 ^ £ . In fact, E^a {r 3 } is rather of the order e H / £ . This is due to the 
fact that even though when starting in x\^ the process is very unlikely to hit x* before x\ 
(this happens with a probability of order e~( hl ~ H ^ £ ), this is overcompensated by the very 
long waiting time in the well x\ (of order e ftl//e ) in case this happens. 
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3.4 Witten Laplacian 

In this section, we give a brief account of another successful approach to proving Kramers' 
law, based on WKB theory for the Witten Laplacian. It provides a good example of the 
fact that problems may be made more accessible to analysis by generalising them. 

Given a compact, d-dimensional, orientable manifold M, equipped with a smooth 
metric g, let QP{M) be the set of differential forms of order p on M . The exterior derivative 
d maps a p-form to a (p + l)-form. We write d^ for the restriction of d to Q P (M). The 
sequence 

-> n°(Af) -^-> X (M) . . . ► O d (M) -^-4 (3.41) 

is called the de Rham complex associated with M. 

Differential forms in the image im are called exact, while differential forms in the 

kernel ker d^ are called closed. Exact forms are closed, that is, d^od^ = or in short 
d 2 = 0. However, closed forms are not necessarily exact. Hence the idea of considering 
equivalence classes of differential forms differing by an exact form. The vector spaces 

ker d ( p ) 

H p (M) = — -— (3.42) 

are thus not necessarily trivial, and contain information on the global topology of M. 
They form the so-called de Rham cohomology. 

The metric g induces a natural scalar product (v)p on Q P (M) (based on the Hodge 
isomorphism *). The codifferential on M is the formal adjoint d* of d, which maps (p + 1)- 
forms to p-forms and satisfies 

(d u, r/) p+1 = (uj, d* r]) p (3.43) 

for all u G Q, P (M) and r\ £ f2 p+1 (M). The Hodge Laplacian is defined as the symmetric 
non- negative operator 

A H = dd*+d*d= (d + d*) 2 , (3.44) 

and we write for its restriction to £l p . In the Euclidean case M = R d , using integration 
by parts in (13, 43ft shows that 

Ag } = -A , (3.45) 

where A is the usual Laplacian. Differential forms 7 in the kernel H P A (M) = ker A^ } 
are called p-harmonic forms. They are both closed (d7 = 0) and co-closed (d* 7 = 0). 
Hodge has shown (see, e.g. |GH94I ) that any differential form oj G £l p (M) admits a unique 
decomposition 

w = da + d*/3 + 7, (3.46) 

where 7 is p-harmonic. As a consequence, %^(M) is isomorphic to the pth. de Rham 
cohomology group H P (M). 

Given a potential V : M — > R , the Witten Laplacian is defined in a similar way as the 
Hodge Laplacian by 

A V:£ = d Vt£ d* Vj£ + d^ £ d v ,e , (3.47) 
where dy, £ denotes the deformed exterior derivative 

d v , £ = ee- v / 2£ de v / 2£ . (3.48) 
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As before, we write Ay^. for the restriction of Ay j£ to Q P (M). A direct computation shows 
that in the Euclidean case M = 

A { °1 = -e 2 A + ^\\VV\\ 2 -^eAV , (3.49) 

which is equivalent, up to a scaling, to the Schrodinger operator ()3.18p . 

The interest of this approach lies in the fact that while eigenfunctions of Ay^. are 
concentrated near local minima of the potential V, those of Ay^ for p ^ 1 are concentrated 
near saddles of index p oi V. This makes them easier to approximate by WKB theory. 
The intertwining relations 

4^dg = dgAg, (3.50) 

which follow from d 2 = 0, then allow to infer more precise information on the spectrum 
of Ay^., and hence of the generator L of the diffusion |HN05| . 

This approach has been used by Helffer, Klein and Nier [HKN04] to prove Kramers' 
law (|1.9p with a full asymptotic expansion of the prefactor C = C(s), and in [HN06] to 
describe the case of general manifolds with boundary. General expressions for the small 
eigenvalues of all p-Laplacians have been recently derived in |LPNVli~] . 



4 Generalisations and limits 

In this section, we discuss two generalisations of Kramers' formula, and one irreversible 
case, where Arrhenius' law still holds true, but the prefactor is no longer given by Kramers' 
law. 



4.1 Non-quadratic saddles 

Up to now, we have assumed that all critical points are quadratic saddles, that is, with a 
nonsingular Hessian. Although this is true generically, as soon as one considers potentials 
depending on one or several parameters, degenerate saddles are bound to occur. See for 
instance [BFG07a, BFG07bJ for a natural system displaying many bifurcations involving 
nonquadratic saddles. Obviously, Kramers' law (I1.9P cannot be true in the presence of 
singular Hessians, since it would predict either a vanishing or an infinite prefactor. In fact, 
in such cases the prefactor will depend on higher-order terms of the Taylor expansion of 
the potential at the relevant critical points [Ste05| . The main problem is thus to determine 
the prefactor's leading term. 

There are two (non-exclusive) cases to be considered: the starting potential minimum 
x* or the relevant saddle z* is non-quadratic. The potential-theoretic approach presented 
in Section f3.3l provides a simple way to deal with both cases. In the first case, it is in fact 
sufficient to carry out Laplace's method for ()3.34p when the potential V has a nonquadratic 
minimum in x*, which is straightforward. 

We discuss the more interesting case of the saddle z* being non-quadratic. A general 
classification of non-quadratic saddles, based on normal-form theory, is given in [BG10J. 

Consider the case where in appropriate coordinates, the potential near the saddle 
admits an expansion of the form 

1 d 

V(y) = -u 1 (y 1 ) + u 2 (y 2 ,...,y k ) + - £ X jV ] + 0(\\y\\ r+1 ) , (4.1) 

j=k+l 



16 



for some r 2 and 2 ^ k ^ d. The functions ui and u 2 may take negative values in a small 
neighbourhood of the origin, of the order of some power of e, but should become positive 
and grow outside this neighbourhood. In that case, we have the following estimate of the 
capacity: 

Theorem 4.1 ([BG10]). There exists an explicit (3 > 0, depending on the growth of u\ 
and U2, such that in the double-well situation the capacity is given by 

e -My2,...,y k )/e dm dyh d ^ 



oo 

ui(yi)/e 



e- Ul(yi,/£ d Vl j=k+l 




J] xH 1 l + 0{eP\\oge\ l+ P) . (4.2 



oo 



We discuss one particular example, involving a pitchfork bifurcation. See [BG10] for 
more examples. 

Example 4.2. Consider the case k = 2 with 

ui(Vi) = — 2 l A ibi ' 

MV2) = \\2V\ + C A yl , (4.3) 

where Ai < and C4 > are bounded away from 0. We assume that the potential 
is even in y 2 . For A2 > 0, the origin is an isolated quadratic saddle. At A2 = 0, the 
origin undergoes a pitchfork bifurcation, and for A2 < 0, there are two saddles at 7/2 = 
i\/|^2|/4C4 + 0(\-i). Let Hi , . . . , fJ-d denote the eigenvalues of the Hessian of V at these 
saddles. 

The integrals in (|4.2|) can be computed explicitly, and yield the following prefactors in 
Kramers' law: 
• For A2 ^ 0, the prefactor is given by 



n( \_ 9 I (Aj + v / 2eC 4 )A 3 . . . Xd 1 . . 

[£) ~ V |Ai|det(V 2 F(^)) ^ + {\ 2 /^Cl)> [4A) 

where the function is bounded above and below by positive constants, and is given 
in terms of the modified Bessel function of the second kind ^1/4 by 

W-J^Sn-K^). (4.5, 
For A2 < 0, the prefactor is given by 



rfe \ _ o. / (P2 + \/2gC 4 )JJ& ■ ■ ■ Hd 1 f a n\ 

| w |det(VM^)) t.^/v^) ' ^ 

where the function is again bounded above and below by positive constants, and 
given in terms of the modified Bessel function of the first kind I±\u by 
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• (4 - 7) 
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FIGURE 8. The prefactor C(e) in Kramers' law when the potential undergoes a pitchfork 
bifurcation as the parameter A2 changes sign. The minimal value of C(e) has order e 1 / 4 . 



As long as A2 is bounded away from 0, we recover the usual Kramers prefactor. When 
IA2I is smaller than yfe, however, the term \/2eC± dominates, and yields a prefactor of 
order e 1 / 4 (see Figure [8]). The exponent 1/4 is characteristic of this particular type of 
bifurcation. 

The functions determine a multiplicative constant, which is close to 1 when A2 S> 
y/e, to 2 when A 2 <C -y/e, and to r(l/4)/(2 5 / 4 VvF) for |A 2 | < y/e. The factor 2 for large 
negative A2 is due to the presence of two saddles. 



4.2 SPDEs 



Metastability can also be displayed by parabolic stochastic partial differential equations 
of the form 

d t u(t, x) = d xx u{t, x) + f(u(t, x)) + s/2~eW tx , (4.8) 

where Wt x denotes space-time white noise (see, e.g. [Wal86| ). We consider here the simplest 
case where u(t, x) takes values in K , and x belongs to an interval [0, L], with either periodic 
or Neumann boundary conditions (b.c). Equation (|4.8p can be considered as an infinite- 
dimensional gradient system, with potential 

-L 



V\u] 



u'{xf + U{u{x)) 



dx 



(4.9) 



where U'(x) = —f(x). Indeed, using integration by parts one obtains that the Frechet 
derivative of V in the direction v is given by 

•L 

[u"(x) + f(u(x))} v(x) dx , (4.10) 



— V\u + riv] 
drj 



71=0 



which vanishes on stationary solutions of the deterministic system dtu = d xx u + f(u). 



In the case of the double- well potential U (u) 



l„,4 



I„2 
2 



the equivalent of Arrhenius' 



law has been proved by Faris and Jona-Lasinio |F JL82| . based on a large-deviation princi- 
ple. For both periodic and Neumann b.c, V admits two global minima u±(x) = ±1. The 
relevant saddle between these solutions depends on the value of L. For Neumann b.c, it 
is given by 

'0 ifLs:vr, 

(4.11) 



u (x) 



2m 
m+1 



sn (^+T +K ( m )' 



??? 



if L > it , 
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where 2\/m + 1 K(m) = L, K denotes the elliptic integral of the first kind, and sn denotes 
Jacobi's elliptic sine. There is a pitchfork bifurcation at L = tt. The exponent in Arrhenius' 
law is given by the difference ^[^o] — ^[ n -]> which can be computed explicitly in terms 
of elliptic integrals. 

The prefactor in Kramers' law has been computed by Maier and Stein, for various 
b.c, and L bounded away from the bifurcation value (L = tt for Neumann and Dirichlet 
b.c, L = 2vr for periodic b.c.) [MSOll IMS031 ISte04j . The basic observation is that the 
second-order Frechet derivative of V at a stationary solution u is the quadratic form 

(«i,«a) ^ (vi,Q[u]v 2 ) , (4.12) 

where 

Q[u]v{x) = -v"(x) - f'{u(x))v(x) . (4.13) 

Thus the role of the eigenvalues of the Hessian is played by the eigenvalues of the second- 
order differential operator Q[u], compatible with the given b.c. For instance, for Neumann 
b.c. and L < tt, the eigenvalues at the saddle u$ are of the form —1 + (Trk/L) 2 , k = 
0,1,2,..., while the eigenvalues at the local minimum it_ are given by 2 + (Trk/L) 2 , 
k = 0,1,2,.... Thus formally, the prefactor in Kramers' law is given by the ratio of 
infinite products 



C 




^2 3 /\J Sln ^ . (4.14) 
+ 2(L/nk) 2 Ysinh(V2L) V ' 

The determination of C for L > tt requires the computation of ratios of spectral de- 
terminants, which can be done using path-integral techniques (Gelfand's method, see 
also |For871 IMT95} ICdV99] for different approaches to the computation of spectral deter- 
minants). The case of periodic b.c. and L > 2tt is even more difficult, because there is a 
continuous set of relevant saddles owing to translation invariance, but can be treated as 
well |Ste04| . The formal computations of the prefactor have been extended to the case 
of bifurcations L ~ tt, respectively L ~ 2tt for periodic b.c. in }BG09j . For instance, for 
Neumann b.c. and L ^ tt, the expression (|4.14|) of the prefactor has to be replaced by 



V+{\x/y/$e/4L) V A i V sinh(V2L) 

where Ai = —1 + (tt/L) 2 . Unlike (I4.14D . which vanishes in L = tt, the above expression 
converges to a finite value of order L — > TT-. 

Putting these formal results on a rigorous footing is a challenging problem. A possible 
approach is to consider a sequence of finite-dimensional systems converging to the SPDE 
as dimension goes to infinity, and to control the dimension-dependence of the error terms. 
A step in this direction has been made in [BBM10] for the chain of interacting particles 
introduced in |BFG07a] . where a Kramers law with uniform error bounds is obtained for 
particular initial distributions. A somewhat different approach is to work with spectral 
Galerkin approximations of the SPDE [BBGllJ. 
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FIGURE 9. Two-dimensional vector field with an unstable periodic orbit. The location of 
the first exit from the domain T> delimited by the unstable orbit displays the phenomenon 
of cycling. 



4.3 The irreversible case 

Does Kramers' law remain valid for general diffusions of the form 

dx t = f{x t )dt + V^dW t , (4.16) 

in which / is not equal to the gradient of a potential VI In general, the answer is negative. 
As we remarked before, large-deviation results imply that Arrhenius' law still holds for 
such systems. The prefactor, however, can behave very differently as in Kramers' law. It 
need not even converge to a limiting value as e — > 0. 

We discuss here a particular example of such a non-Kramers behaviour, called cycling. 
Consider a two-dimensional vector field admitting an unstable periodic orbit, and let T> 
be the interior of the unstable orbit (Figured]). Since paths tracking the periodic orbit do 
not contribute to the rate function, the quasipotential is constant on dT>, meaning that 
on the level of large deviations, all points on the periodic orbit are equally likely to occur 
as first-exit points. 

Day has discovered the remarkable fact that the distribution of first-exit locations 
rotates around dT>, by an angle proportional to logs |Day90, Day94, Day96|. Hence this 
distribution does not converge to any limit as e — > 0. 

Maier and Stein provided an intuitive explanation for this phenomenon in terms of most 
probable exit paths and WKB-approximations [MS96]. Even though the quasipotential 
is constant on dT>, there exists a well-defined path minimising the rate function (except 
in case of symmetry-related degeneracies). This path spirals towards dT>, the distance to 
the boundary decreasing geometrically at each revolution. One expects that exit becomes 
likely as soon as the minimising path reaches a distance of order y/e from the boundary, 
which happens after a number of revolutions of order log e. 

It turns out that the distribution of first-exit locations itself has universal character- 
istics. The following result applies to a slightly simplified system obtained by linearising 
the dynamics around the periodic orbit. 

Theorem 4.3 ([BG04J). There exists an explicit parametrisation of &D by an angle 9 
(taking into account the number of revolutions), such that the distribution of first- exit 
locations has density 

p -(0-0o)/AT K 

P{9) = /transient^) ~ M^ 1 )) , (4.17) 
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where 

• /transient (#) is a transient term, exponentially close to 1 as soon as 6 S> |loge|; 

• T is the period of the unstable orbit, and A is its Lyapunov exponent; 

• Tk = Ce -1 / 2 e v / £ plays the role of Kramers' time; 

• the universal periodic function P\t{0) is a sum of shifted Gumbel distributions, given 
by 

Pxt(0) = Y, A (0~kXT) , A[x) = l e - 2a; -§ e ~ 2 * . (4.18) 

Although this result concerns the first-exit location, the first-exit time is strongly 
correlated with the first-exit location, and should thus display a similar behaviour. 

Another interesting consequence of this result is that it allows to determine the resi- 
dence-time distribution of a particle in a periodically perturbed double-well potential, and 
therefore gives a way to quantify the phenomenon of stochastic resonance [BG05]. 
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